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Abstract 

During this age of data proliferation, heavy reliance is placed on data visualisation to support users in 
making sense of vast quantities of information. Informational Dashboards have become the must have 
accoutrement for Higher Education institutions with various stakeholders jostling for development priority. 
Due to the time pressure and user demands, the focus of development process is often on designing for each 
stakeholder and the visual and navigational aspects. Dashboards are designed to make data visually 
appealing and easy to relate and understand; unfortunately this may mask data issues and create an 
impression of rigour where it is not justified. This article proposes that the underlying logic behind current 
dashboard development is limited in the flexibility, scalability, and responsiveness required in the 
demanding landscape of Big Data and Analytics and explores an alternative approach to data visualisation 
and sense making. It suggests that the first step required to address these issues is the development of an 
enriched database which integrates key indicators from various data sources. The database is designed for 
problem exploration allowing users freedom in navigating between various data-levels, which can then be 
overlaid with any user interface for dashboard generation for a multitude of stakeholders. Dashboards 
merely become tools providing users and indication of types of data available for exploration. A Design 
Research approach is shown, along with a case study to illustrate the benefits, showcasing various views 
developed for diverse stakeholders employing this approach, specifically the the Digital Decision Network 
Application (DigitalDATA) employed at Unisa. 

Keywords: dashbnoards, big data, management information systems, data-use 


Introduction 

The aim of the article is to explore sound approaches of meeting stakeholder data sense-making 
requirements in the era of over-whelming data demand and supply, while maintaining data integrity, 
consistency, and flexibility. Traditionally higher education institutions have access to relatively large data 
sets and tools for analysis. This is growing exponentially with the ever increasing amount of digital student 
data that can be harvested and analysed, as well as increased technological and analytical capabilities 
(Wishon & Rome, 2012). Analytics has been described as the “new black” (Booth, 2012), and student data 
as the “new oil” (Watters, 2013). The 2013 NMC New Horizon report: Higher Education Edition (New 
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Media Consortium, 2013) identifies learning analytics as one of the key emerging technologies to enter 
mainstream use from 2015-2016. The report (New Media Consortium, 2013) also identifies dashboards as 
a key technology in leveraging the power of data at all stakeholder levels. Unfortunately, the current focus 
and approach to dashboard development will not be able to meet the rapidly growing demands of 
supporting sense-making in the world of big data, particularly in the Higher Education (HE) environment. 

The authors therefore suggest a paradigm shift embracing much of current leadership thinking, which 
focusses on a Complex Adaptive Systems (Choi, Dooley, & Rungtusanatham, 2001; Hodgson, 2016; 
McGreevy, 2008). This allows for more responsive organisations and embraces futurist thinking about 
education, the changing face of employment and graduateness (Archer & Chetty, 2013; Bridgstock, 2009; 
Hodgson, 2016; Yorke, 2011). This being said, the focus of this paper is not the debate around Complex 
Adaptive Systems (CAS) in leadership and management, as this has been extensively debated (For example 
Choi et al., 2001; Davis & Blass, 2007; Dooley, 1997; Dougherty, Ambler, & Triantis, 2016; McGreevy, 2008; 
Van der Merwe & Verwey, 2008). The article focuses on how various Higher Education (HE) stakeholders, 
particularly in an Open and Distance Environment, may be provided, with timely, appropriately, quality 
assured, and flexibly represented information. The purpose of such an approach would be to equip 
stakeholders to deal with the dynamic nature of and the constantly increasing demands made of Higher 
Education globally, as well as nationally (Department of Education [DoE], 1997; UNESCO, 1998, 2015), and 
Open and Distance Learning in particular (Department of Higher Education and Training [DHET], 2014). 

The case is illustrated employing data pertaining to HE teaching and learning, but is also already being 
applied in the context of Research, as well as Estate and Space data at Unisa. The paper should not be 
confused with work on learning analytics, but relates to information sense-making approaches in HE 
(which may include benefits in the fields of Learning Analytics and Academic Analytics, but are not limited 
to these). In particular a novel approach is suggested which questions the foundational principles and 
dominant current thinking around Dashboard design that focuses on visualization and memorability (Abd- 
elfattah, Alghamdi, & Amer, 2014; Borkin et al., 2013; Cook, Grinstein, & Whiting, 2014; Few, 2006, 2007; 
Newman & Elbourne, 2004). 

The approach is particularly powerful as it ensures a centralised, quality controlled database, which allows 
users at many stakeholder levels the flexibility and autonomy to find and explore data in diverse ways to 
investigate trends and possible contributors to such trends, as well as possible avenues for solutions. The is 
congruent with a CAS approach to leadership in the dynamic, and currently highly volatile environment of 
Higher Education globally and in South Africa (#feesmustfall & #rhodesmustfall) (Council on Higher 
Education, 2016; Pillay, 2016). CAS allows for responsiveness through: emergence, contextually sensitive 
co-evolution, being sub-optimal (good enough), embracing variety for creativity, employing connectivity 
within nested systems, simplicity of rules, as well as constant iterations of self-organisation on the spectrum 
between equilibrium and chaos. These principles are embodied in the DigitalDNA system illustrated in this 
article. 
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Rationale 

With the exponential growth of data, with multiple complex relationships, differing constraints, the ever 
changing environment, uncertainty and time pressure, problem-solving and decision-making can be 
overwhelming. Dashboards have emerged as a technology to enable stakeholders to engage with the data 
effectively through various data visualisations (Abd-elfattah et al., 2014). A dashboard can be defined as: “a 
visual display of the most important information needed to achieve one or more objectives; consolidated 
and arranged on a single screen, so the information can be monitored at a glance” (Few, 2006, p. 26). 
Dashboards started to gain traction in the 1980’s in the business world and have since gained popularity in 
Higher Education (Newman, Thomas, & Webber, 2014). Many tools have been developed for the purpose 
of designing dashboards and disseminating information in graphic formats, these include IBM Cognos, 
Oracle BI Foundation Suite, SAS Enterprise Intelligence Platform, SAP Business Object BI Platform, 
MicroStrategy, QlikView, and Web Focus, amongst others. Verbert, Govaerts, Duval, Parra, and Klerkx 
(2014, p. 1505-1506) document and compare 24 different Learning Analytics ashboards alone. 

The proliferation of tools and demands by various stakeholders for immediate development of their own 
personal dashboards, has often lead to a focus on the visualisation design aspects of the dashboards or the 
particular functionality of the tools at the expense of the message being communicated. Dashboards are by 
definition custom designed and as such the design process often starts with determining the stakeholder 
needs and elements to be present in the dashboard. Once this is determined, the required data is accessed, 
transformed, and integrated from various data sources in order to provide the data for the visualisation. 
This results in a multitude of smaller enriched data sources, each designed to service a particular dashboard. 
This proliferation makes it more complex to apply changes to business rules consistently, be responsive, 
manage data quality, and remain flexible as user needs change. (Abd-elfattah et al., 2014; Few, 2006; 
Newman et al., 2014) 

Another aspect of data dashboards which has received growing attention, is that of memorability (Borkin 
et al., 2013). These authors (Borkin et al., 2013) state that memorability is intertwined with cognition and 
understanding and an important step in designing visualisations such as dashboards. 

From a design perspective a distinction is drawn between visualisations for learning analytics and for 
broader applications as decision support systems (DSS). While there are some resently published examples 
of working dashboards for learning analytics (Verbert, Govaerts, Duval, Luis Santos et al., 2014; Verbert, 
Duval, Klerkx, Govaerts, & Luis Santos, 2013; Govaerts, Verbert, Duval, & Pardo, 2012; Govaerts, Verbert, 
Klerkx, & Duval, 2010), and while there are many publications that describe the frameworks used in 
complex systems (see the summary by Sokolova, Fernandez-Caballero, & Gomez, 2011), there are relatively 
few illustrating effective visualizations and design interfaces in the area of DSS, they appear to be more 
conceptual (Fakeeh, 2015; Susnea, 2013; Bresfelean, Lacurezeanu, Ani, & Pop 2009; Bresfelean, Ghisoiu, 
Lacurezeanu, & SitarTaut, 2009) 

While the DigitalDNA development has the student as an identified agent, the construct is more comparable 
with DSS development than with learner analytics visualisations. A DSS can be described as any tool 
designed to enhance the process of decision making in intricate or complex systems, and are used 
predominantly where information is “indecisive” or “partial” (Fakeeh, 2005). The DigitalDNA 
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development can be classified as part of the Data-driven DSS toolset as described by Kacprzyk and Zadrozny 
(2007). 

Bresfelean and Ghisoiu (2010) present: (1) the key figures of the DSS definitions and fundamental 
characterizations of DSS, (2) state of the art research in decision support and DSS for socio-economic areas, 
(3) a newly formulated decisional model, and (4) the latest DSS classifications from the research literature. 
The DigitalDNA development follows a similar pattern but attempts to move away from the traditional 
“drill-down” and “drill-through” approach towards a comprehensive collection of interconnected data 
points and data sets contained within a navigation framework that recognizes familiar decision pathways 
(e.g., the student walk). 

In this article, an alternate approach is explored where the focus is on establishing a comprehensive, highly 
navigable Decision Network Application, known as DigitalDNA , which allows users to navigate and 
interrogate data in real-time to address their dynamic informational needs. The focus is on designing and 
integrating enriched data sets with an intuitive navigation system, allowing users to easily move between 
various data nodes and explore each node in detail to gain a comprehensive understanding of the real-time 
data relating to their specific challenge. Data presented is as close to real-time as possible, focusing rather 
on allowing problem identification and exploration at will, as opposed to memorability and visualisation. A 
prime objective is to link points of interst in a way that the “distance” beteween data points is all but 
eliminated and more extended information is available on demand. 


Contextualisation 

This article explores the case of the University of South Africa (Unisa) where an alternative approach is 
employed to data visualisation. The University of South Africa is one of the mega Open Distance Learning 
(ODL) institutions in the world with more than 360,000 students and a range of courses and programmes. 
Unisa already has access to a staggering amount of student data, hosted in disparate sources, and governed 
by different processes. The data covers the entire student journey from admissions and registrations, to 
learning activities, course success, retention, graduation, employment, and citizenship. The existing data 
sources lie in various operational systems maintained by various functional units within the institution. The 
disparate systems typically have grown from the need of different functional areas to have customised 
functionality built into operational systems. These developments take place without a reporting or analytic 
objective in mind with the result that leveraging and integrating sensible information from these sources 
becomes problematic. (Prinsloo, Archer, Barnes, Chetty, & Van Zyl, 2015) 

As the university moves to mainstreaming online learning, the amount of data and need for analyses of data 
are increasing, raising important questions regarding our assumptions, understanding, data sources, 
systems and processes (Prinsloo et al., 2015). This exemplary case documents the development of an 
enriched central database (through Design research), created to extract, combine, transform, and house as 
many possible data elements which may have value to a diverse body of users. This database serves as the 
basis for many different dashboards, serving the needs of stakeholders ranging from students, staff, student 
support services, to management. This approach allows for greater consistency of data presentation and 
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rule application across various views and shifts the focus from the dashboard software capabilities to the 
core principles of relevance, clarity, data quality, consistency, and ability to explore. It also allows for greater 
independence from any individual dashboard design tool. 


Theoretical Framework 

The theoretical framework employed in this study is that of Complex Adaptive Systems. The length of the 
article precludes an extensive explanation, but the foundational principles are illustrated and related to the 
study below in Table l. 

Table l 


Emergent Charecteristics of Complex Adaptive System Principles Applied to the DigitalDNA System 


Principle 

Explanation 

Operationalisation in 
DigitalDNA 

Emergence 

What appears as randam interactions 
between agents in the system are 
patterns which informs the actions of 
the agents and system itself. 

Data points and sets are linked via common 
attributes that may not necesarily be 
structural or heirarchichal. 

Co¬ 

Systems exist within environments, to 

Information is intergrated with existing sets 

evolution 

ensure fit they must change with the 
environment which in turn adapts to 
the system. 

on the basis of “related data” within the 
“student walk” process. 

Sub-optimal 

The system should be good enough, 
not perfect, as the goal is efficiency 
within constant change. 

Since the first draft of this article through the 
review process the system has evolved and 
several new functionalities developed. 

Requisite 

Ambiguity, paradox, and 

Variety is constained only by the underlying 

variety 

contradictions to create new 

possibilities, so variety is essential for 
ingenuity and creativity. 

avaialability of data recorded for each process 
and the questions posed. 

Connectivity 

Relationships between agents are 
critical to the systems survival, these 
represent the patterns which ensure 
the survival of the system. 

Connectivity is the key driver of the system, 
“connectedness” is determined largely by the 
user and not by organisational structure. 

Simple rules 

The rules governing system 

functioning are simple even if the 
patterns are varied and rich. 

Navigation “rules” follow the known and 
possible decisions around the “student walk.” 

Iteration 

Small changes can have significant 
impact through the emergence of 
feedback loops. 

An attempt on facilitating feedback loops is 
contained in the navigation design and is 
currently focus of further attention. 
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Self- No hierarchy, command, or control, (see the point on Emergence above) 

organising just constant organising to find the 
best fit. 

The systems exist on a spectrum The system currently runs the risk of 
ranging from equilibrium to chaos, “information overload” to the user, further 
with the edge of chaos representing the developments in this regard will consider 

most variety and creativity. Rules and mapping a suite of “decision pathways” to 

restrictions ensure some address this issue, 

predictability. 

Nested Systems are nested in other systems This is facilited through the integration of 

systems with systems often being smaller sub- various data elements but connected via 

systems within larger systems. relationships and not structures. 


Edge of 
chaos 


Table l above presents the principles linearly while Figure l below show the iterative, complex and systemic 
nature of this approach. This paper will not extensively deal with CAS (For further reading on CAS see Choi 
et al., 2001; Dooley, 1997; Dougherty et al., 2016; Lewin & Regine, 1999; McGreevy, 2008). The feedback 
loops and iterative nature of CAS is better illustrated in Figure 1 below. 



Complex Adaptive Systems 


compete 

cooperate 

collaborate 


Strategizing and Modeling 
goal 

(viability) 

prediction about 
future state of . 

system & 
environment 



fitness to the 
system and 
environment 


combinations of 
building blocks 


collectively 

yield 



Environment 

multi-faceted, non¬ 
level playing field 
creates bias 




emergent 
system 
behavior and 
a changed 
environment 


'environment 
also fluctuates 
independently of 
system’s 
influence 

e.g., parcelleation 
speeds evoutipn 


Figure l. Illustration of a Complex Adaptive System (Innovation Labs Strategic Solutions, 2007). 
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Methodology 

This article employs the insights, experience, and thoughts of researchers faced with the often 
undocumented realities of engaging with vast amounts of data and providing actionable information to 
stakeholders. A methodology that was congruent with the conceptualization of the DigitalDiVA system in 
terms of its responsive and dynamic nature which emphasizes contextual sensitivity was essential. This 
study thus employed Design Research which has its root in educational research, particularly curriculum 
and technology design (Plomp, 2013). As the name implies, the purpose of Design Research is to blend 
design and research. Design Research “seeks to increase the impact, transfer, and translation of... research 
into improved practice” (Anderson & Shattuck, 2012, p. 16). 

Design Research results in two distinct outcomes: 1) an intervention or product to address the issue being 
studied, and 2) a set of design principles to implement the intervention in other contexts. These principles 
constitute the contribution to the academic domain and allow for transferability (Barab & Squire, 2004; 
Collins, Joseph, & Bielaczyc, 2004; Herrington, McKenny, Reeves, & Oliver, 2007; Nieveen & Folmer, 
2013). Design Research follows a cyclical iterative approach to design, development and implementation, 
which informs each subsequent cycle of design moving through three distinct phases of research: 
preliminary phase, development phase, and assessment phase (Archer & Howie, 2013; Herrington et al., 
2007; Plomp, 2013). In terms of ensuring the quality of the research, the phases shift from examining the 
context and requirements, to developing various aspects of the intervention and product and, finally, 
evaluating the completed intervention or product as a whole. The quality criteria employed are relevance 
(content validity), consistency (construct validity), expected and actual practicality, and expected and actual 
effectiveness (Nieveen & Folmer, 2013; Plomp, 2013; Van Den Akker, Gravemeijer, McKenney, & Nieveen, 
2006; Van den Akker, 2013; Wang & Hannafin, 2005). Design Research emphasises the importance of 
evaluation and, in particular, the use of both experts and users in the evaluation of each cycle. The approach 
also provides guidance on how the focus of the evaluation should shift during the phases by providing 
quality criteria. The process is illustrated in succinctly in Figure 2 below, including the research questions 
focused on in each phase. 
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Figure 2 . Design Research Process for DigitaLDATA, including research questions and Evaluative Foci. 

The following section describes the final prototype and its application in the Higher Education Context. An 
exemplary case study is employed for this purpose. 

Exemplary Case 

The rest of the articles is devoted to illustrating the product of the Design Research process, DigitalDNA, as 
well as the design principles or contribution to the academic body of knowledge, allowing for transferabitity. 
This is accomplished by means of an example of how the Digital Decision Network Application, known as 
DigitaLDATA, can be used to facilitate real-time data exploration will be presented in order to illustrate how 
the central, enriched database can be leveraged to provide actionable, timely data to stakeholders with 
diverse needs. We will launch the discussion by discussing the design logic employed in the development of 
this data exploration environment, followed by an example of the application and illustration of the node 
access screens (dashboards), as well as a discussion of the various data exploration approaches. 

Design Logic 

The DigitaLDAA development started off with the following tentative global design principles established 
through contextual needs analysis at Unisa and literature review which constituted the Preliminary Design 
Research Phase: 

1. The database should be aggregated and built up from the lowest level of granularity (in the case of 
our example, the student). 

2. The database should be enriched with additional variables and calculations which are often 
required in exploration and reporting. 
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3. Data will exist in layers as they relate to key nodes. 

4. Users must be able to enter DigitaLDATA at any layer through one of the key nodes. 

5. Users must be able to explore each layer in depth and move between layers at will to extend their 
exploration. 

6. It must be possible to aggregate and disaggregate data at various levels and for various purposes. 

In this illustrative case we will examine data with pertaining to teaching and learning. DigitaLDATA in this 
case is layered around three entry nodes (See Figure 3): 

1. Student. 

2. Module. 

3. Qualification. 



k 


>2 

( ‘ I * 1 

Student 


V / 

hTt 




Module 



Qualification 


Figure 3. DigitaLDAk 4 data sphere. 

DigitaLDATA holds the real-time, enriched and quality assured data warehouse. Users can enter DigitaLDATA 
to start their exploration at any one of these nodes using either a student identifier (e.g., name, date of birth, 
student number), module identifier (e.g., module code, module name, navigation through the curriculum 
structure), or qualification identifier (e.g., qualification code, qualification name, navigating through the 
organisational architecture). Once a user has entered any of the layers, they are presented with a dashboard 
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which provides key information for the particular student, module, or qualification they are exploring; a 
signboard to orientate the user to what is available for exploration in that layer of data, based on a certain 
level of granularity. These dashboards act as the layout map for each level to show what aspects can be 
explored at the particular level of granularity. Once a user has engaged with the primary dashboard for the 
level they can click on various aspects that they wish to explore further, either moving through data on the 
same level (drill through, 360°), or shifting their exploration of the data to another level (drill down or roll 
up). Moving through and between levels is facilitated through catalysts in the form of student, module, and 
qualification identifiers. 

The idea is similar to exploring a multi-storey department store (see Figure 4). Each node (student, module, 
or qualification) would represent a different department (data-level). Each department is located on its own 
floor. You navigate between the floors (data-levels) by using the elevator (catalysts - student, module, and 
qualification identifiers). When you reach the specific store you need to select which department (data 
layer) to go to and use the lift to go to the relevant floor by using your entry key (node identifier). Once the 
elevator door opens you are confronted with a sign-board showcasing what is available on that particular 
floor and you can decide which areas to visit (floor map or node dashboard). When you have finished on a 
particular floor, you may want to move to another floor and thus, use the elevator (catalysts or node 
identifiers: student, module, or qualification identifiers) again to navigate to a new floor, to explore what is 
available in more depth. We will now illustrate how Digital .DNA can be employed through an enrolment 
management example relevant to the higher education environment. 



Figure 4. DigitaLDiVA as a multi-storey data department store. 
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Scenario 

A college (faculty) planning workshop is taking place. The purpose of the meeting is to establish 
qualification enrolment targets for the 2016 cohort. 

The workshop is attended by the: 

• Top Management: Institutional development and Academic Planning, 

• College level: Executive Directors, Deputy Deans, Heads of Schools, Heads of Departments 

• Support: Strategic Planning, Quality Assurance, Information Analysts Purpose - setting enrolment 
targets for each qualification for 2016 cohort 

Initial discussions and engagement has resulted in the development of draft targets which have been 
captured on a planning spreadsheet to capture targets from 2016 to 2019. These targets now need to be 
interrogated to examine if they are feasible and where action is required to achieve the targets. 

View Point Identification 

A data exploration will now take place using the data DigitaLDiVA to determine which qualification targets 
may be problematic and to explore how to either achieve these targets or adjust the targets. An example of 
such a spreadsheet is provided in Figure 5 below. 


Qual 

Code 

New Qualification Name Strategy 

2010 

2011 

2012 

2013 

201 

\ 2015 

2016 

98615 

HIGHER CERTIFICATE IN ADULT BASIC EDUCATION AND Growth strategy 

2709 

3721 

5735 

11318 

1064 

| 9307 

13084 

98999 

MASTER OF EDUCATION in Open and Distance Learning Sustainable strategy 




13 

1 

’ 19 

15 

99001 

DIPLOMA in Adult Basic Education and Training Declining strategy 

4460 

4608 

3314 

2206 

172 

i 1316 

1006 

0264X 

ADVANCED CERTIFICATE IN EDUCATION (FOUNDATION Phased out 

106 

13 

4 

1 



0 

0328X 

ADVANCED CERTIFICATE IN EDUCATION: TOURISM EDU Phased out 

37 







0376X 

ENDORSEMENT: SPECIALISATION IN INCLUSIVE EDUCA1 Phased out 

14 

25 

21 




0 

2032X 

DIPLOMA IN EDUCATION (SECONDARY PHASE) Phased out 

1 








Figure 5. Excerpt of Qualification History and Targets Set for 2016. 

The data exploration approach employed in this case emphasises identification of problem areas and then 
diagnostic exploration to have comprehensive data to facilitate planning and decision making. As such, the 
first step is to identify the targets that warrant further exploration in this case. 

From Figure 5 it can be seen that the targets envisaged for the Qualification 98615 (Higher Certificate in 
Adult and Basic Education and Training) may be problematic given the recent enrolment history. Whilst 
enrolment increased steadily from 2010 to 2012 and showed a jump in 2013, there has been a steady 
decrease in enrolments since 2014. The new enrolment targets require a more than 40% growth, not only 
altering the trend of decreased enrolment, but requiring higher intakes than have been achieved in the last 
six years. This proposal should thus be explored in greater depth and seen from a range of different 
perspectives to determine if the proposal is reasonable. As we are using the new data DigitaLDiVA 
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exploration approach we need not request a full examination by our Information and Analysis Directorate, 
but merely request that the analyst, present in the session, uses DigitaLDiVA to explore the data for this 
qualification in real-time so that we may engage with it. 

Possible Viewpoints 

In exploring this qualification we want to examine a number of aspects in order to make and informed 
decision and plan actions to achieve our targets. These aspects can be seen as possible viewpoints during 
our data exploration journey (see Table 2). 

Table 2 

Possible Viewpoints for Exploration 



Viewpoints 


Possible question 

1 

Equivalents 

• 

How do previous versions of the qualification contribute to 
enrolments and graduates? 

2 

Attrition analysis 

• 

What is the pool of interest and potential uptake? 

3 

Qualification Flow Planning 

• 

What are the inflows and outflows -intake, first time intake, 
returning, dropouts and graduates? 



• 

How many provisional enrolments are required to achieve 
the official census date targets 1 and what is the required 
workload? 

4 

Cohort 

• 

What is the spatial distribution of these students? 



• 

What is the race, gender, matric score, age distribution of the 
students in this qualification? 

5 

Risk Management 

• 

What risk aspects are defined for this qualification? 



• 

What modules are included in this qualification and how 
many of these modules have been deemed as “at-risk”? 


Our first step in this journey is to enter DigitaLDiVA at one of the nodes. In this case, it is the qualifications 
node; however, before we explore this fully, we need to determine if there are any equivalents to the current 
qualification naming and number allocation. Specifically, how do previous versions of the qualification 
contribute to enrolments and graduates? This is explored through the Equivalents view, which in this case 
shows us that while there is a previous equivalent qualification, only one student from the previous 
qualification is still busy completing and students will no longer be able to enrol in the old qualification 
code. The old qualification enrolment numbers have already been incorporated into the new code. We are 
thus free to explore this qualification employing only the new qualification code (Figure 6). 


1 Official audit data captured on a specific date for Higher Education Institution in South Africa is known as HEMIS 
(Higher Education Information Management System) data. This is the basis for subsidy allocation by government. 
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Mo 


College 

CESM1 


Oual 

Code 


Qua Irf ic-ation 


t CED'U ajm? 97M2 - HIGHER CERTIFICATE IN ADULT BAS1C 
(7) EDUCATIO' M AMD TRAINING-HC AB ET 

; Replaces 8B815) No Research 


2 CEDU g,Bo1S MfilS-HIGHER CERTIFICATE In Adul! Base 

Educafcn and Train ng - HCERT AB ET•; Rea aoed 
by 97042} ■ No Research 



37S42 - Registered 


Category 

7 

2-m 

2™ 

241141 

24111 

24112 

24113 

24114 

24113 

241141 

24117 

24113 

■ 97942 




2903 

3929 

5920 

2953 

1 

1 




■ 98815 






1 

,3703 

10700 

10155 




| Totel 




2903 

3929 

5921 

■ ■ 399 

10791 

10150 





97942 replaces 90615. 


Figure 6. Three Excerptsof the Digital DNA 2 equivalence view. 

Now that we have identified the relevant qualification code, we can examine this data-level further. Our 
first stop is the attrition view: What is the pool of interest and potential uptake? Here we have an overview 
of the various types of attrition taking place (see Figure 7). 


No 

eo'lege Qua 1 - Description <|*22^) 

Formal Actual 

Framework Period ^ 

Withdrawn Cancelled Temporary Registered 

(TotN| <TotN| (TotN| (TotN) 

(Ave N| (Ave N| (Ave N| (Ave N) 

(Rateti) |Rate%) (Rate%| (Rate%) 

Attrition 

fTotN) 

(AveN) 

|Rate%) 

Active 

HEMIS (TotN) 

Period (Ave N| 

|Rate%| 

Converted 

(TotN) 

(Ave N| 
(Rate %) 



CEDU 97942 - HIGHER CERTIFICATE N ADULT 3ASC 
EDUCATION AND TRAINING -HCA3ST Reolaces 
988151 No Research 

Natona l-fcgher 
Cert*caie 

0*5) 
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Figure 7. Attrition view. 


2 All Excerpts are from the DigitalDNA System from this point forward. 
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The attrition view considers data from applications through to final registrations and active students 
submitted for statutory purposes. This view enables a rich understanding of the various points of attrition 
along that trajectory. 


Our next point of interest is to see the inflow/outflow and planning over time, thus the Qualification Flow 
Planning View. We can explore current and historical enrolment data: What are the inflows and outflows - 
intake, first time intake, returning, dropouts and graduates? We can also explore scenarios to achieve 
growth: How many provisional enrolments are required to achieve the official census date targets and what 
is the required workload? The two excerpts from the Qualification Flow Planning view are provided below 
in Figures 8 and 9. 


Actual Inflow & Outflow 


Category 

2011 

2012 
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Figure 8. Excerpt qualification flow planning view - actual inflow and outflow. 
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Planned inflow & Outflow 
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Figure 9. Excerpt qualification flow planning view - planned inflow and outflow. 


As a next step in our exploration we wish to examine (profile) the students within this qualification. As with 
all the other views, the cohort view is highly flexible and the user can select the information he or she wishes 
to engage with (see Figure 10) 
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Cohort Profiles ... 



Qual Academic Peiiod 

Deg-ee Level 

Quail Completion Status Code 
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Quail Semester Type 
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Qua! Registration Method 
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Qual Date Of Registration 

Metro nality Continent 

Matric Certificate 


Quail Type Code 

Qual Registration Period 

Metro na lity SADC 

f Mbtrie Agg-egate 


Qua! Category Code 

Registration Day 

i^l Resdenoe Country 

i^i Grouped Matric Aggregate 
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Residence Continent 
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Q Delivery Method 
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Registration Status ‘Code 
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Chart View : * 2D 3D 

Close | Apply 

Disciplinary Flag 



Figure 10. Excerpt cohort view - profile display options. 

In this first selection, we are examining: What is the race, gender, matric score, and age distribution of the 
students in this qualification? (see output in Figure 11) 
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Demographic Group 


2015 Cohort by Demographic Group' 

Cohort Enrolments for Demographic Group =' African Female' {%) 
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Figure 11. Excerpts cohort view. 

Finally, we pay a visit to the Risk Management View: What modules are included in this qualification and 
how many of these modules have been deemed as “at-risk”? What risk aspects are defined for this 
qualification? The main Risk Management View condenses multiple indicators into one succinct view. In 
our case we are interested in which modules have been deemed “at-risk” or “high-risk” (last 2 columns). We 
are also interested in any modules included in these qualifications which have seen a decrease in the Normal 
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exam pass rate (NPR)3. These three modules can thus be identified for additional support to improve the 
pass rate, (see Figure 12) 



Figure 12. Excerpts risk management view. 


Discussion 

This data exploration process takes place in real-time during the college planning workshop, ensuring 
relevant, accurate, and timely data is available to make the decisions. As the exploration progresses, the 
participants may explore certain avenues and determine what is or is not relevant to the discussion and 
problem at hand. The data exploration is logged and exported into a report to support and document the 
decisions taken at the workshop. Based on the comprehensive real-time exploration the group could 
establish the pool of interest, pool of suitable candidates, points of possible attrition, geographical 
distribution of students, student profiles, and success and barriers to graduation. With this information it 
is possible to decide whether the envisaged enrolment target is feasible, and if so, what is required to achieve 
it. 

In this example the participants in the workshop decide that although the enrolment in this qualification 
has decreased over the last few years, there is a high enough pool of interest that the 2016 target could be 
achieved. It is clear, however, that some interventions will be required to attain this target. The attrition 
view shows that while there is a high interest in the qualification (80,195 applications), this does not 
necessarily convert into enrolments (36,759 registrations); this may be because many students see this 
qualification as a second or third choice. The attrition rate (from application phase) for this qualification is 
very high (54%) and there are also a number of high-risk modules in the qualification. Providing additional 
support for these modules may ameliorate some barriers to success for this qualification. The envisaged 
2016 enrolment target is thus approved with the following interventions to be put in place: 

• Marketing: 

o General marketing to increase interest and applications for the qualifications. 


3 Normal exam pass rate - the number passed relative to those that wrote (no deferments) - the field shows the 
previous year pass rate, an indicator of change (up or down) and the current year pass rate 
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o Targeted marketing aimed at students who have applied for this qualification in order to 
increase the conversion of applications into registration. 

• Resource allocation: 

o Assigning additional online tutors to support the projected increased enrolments, 
o Assigning face-to-face tutoring for students in “at-risk” modules. 

o Scheduling additional face-to-face tuition sessions based on the geographical distribution 
data available. 

Catering for Various Users 

The example provided above was for management planning purposes at a high level. However, DigitaLDiVA 
is designed in such a way to allow for multiple levels of exploration for various users. Exploration must be 
made possible for users dealing with various entry nodes and users who have varying levels of data literacy 
and knowledge of the system. In addition, role based access is facilitated to ensure dissemination of 
appropriate data and information to users. As such, three supporting features are essential: the entry node 
dashboard (data-level floor plan), providing suggestions for exploration (suggested itineraries), and the 
ability to move between the various data-levels at will (catalysts or elevators). These are discussed shortly. 

Node Dashboards 

Users can access DigitaLDiVA at three nodule entry points: student, module, and qualification. The node 
dashboard is the first data view that the user will be confronted with once they enter a data-level. This will 
provide a quick overview of the data available at that level with click through and click around capabilities. 
This is a simple diagnostic view which allows the user to see which aspects they are concerned about for the 
student, module, or qualification they are exploring and provide them with the opportunity to visit different 
aspects for more information about these areas of concern. An example of the student node dashboard is 
shown in Figure 13 below to illustrate this. The data drawn for this dashboard is from the same DigitalDiVA 
system and users can easily click through and explore to a more aggregated level from here all the way to 
the qualificiation level where we started our illustrative case. 

Student Progress & Risk Reporting ( DIA-SM ver 2.2 - rel 20.05.2015 ) 

Home Dashboard Reports ▼ Tabular Reports ▼ Administrator (4) ▼ User ▼ 



UNISAST 
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Figure 13. Excerpt student node dashboard. 


Information is “on demand”; the various sections open depending on the interest and requirements of the 
user. An example of an “expanded” section for habits and behaviours alone is given in Figure 14. A multitude 
of data is thus available for this student on demand. 
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Figure 14. Excerpt student node dashboard expanded for habits and behaviour. 

Suggested Itineraries 

DigitaLDATA can be quite overwhelming for new users, particularly as most users move from having difficulty 
to access any data to having a wealth of navigable data at multiple levels of granularity and aggregation at 
their fingertips. To support users who are making their first forays into exploring Digital DNA, certain 
suggested itineraries have been developed for common types of explorations (Table 3). 
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Table 4 


Example Itineraries 


Reason for 
exploration 

Suggested view points 

Quality assurance 

Qualification or Module Quality Assurance Metrics or Aggregated 
Quality Assurance Metrics. 

Risk analysis 

Qualification Risk Rankings; Module Risk Identification; Module “At- 
Risk” Report; Student Progression Risk. 

Profiling 

Qualification Cohort Profile; Module Cohort Profile; Current and 
Planned Enrolment Profile; Current Student Profile; Student Habits 
and Behaviour Profile. 

Predictive 

analytics 

Qualification Inflow/Outflow Modelling; Qualification Retention and 
Success Predictions; Module Attrition and Success Predictions. 

Success analyses 

Qualification Throughput and Success; Module Examination Success. 


These itineraries suggest certain viewpoints which may be of value when confronted with various decisions 
and challenges. Of course the user may deviate and use their own exploration, but it is suggested that at 
minimum these views are explored. 


Catalysts 

The real power of Digital DNA is the ease of navigation, not just within each data-level, but between the 
various levels. This is facilitated through a navigation bar consisting of a collection of icons that allows the 
user to jump from one data layer to another at will to further the data exploration. These bars are located 
in various places in the design of the dashboard (see Figure 15 below). It is also possible to click on any of 
the modules in the curriculum window to act as catalysts to explore the module level in depth. 
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Academic Year: 2015 
Qualification Code: 98615 

Qualification: HCERT ABET 
Qualification Desc: HIGHER CERT ABET 

Qualification Long Desc: HIGHER CERTIFICATE In Adult Basic Education and Training 
Qual Type Code: C 
Highest Qualification: Certificate 
Formal: Formal 
Qual Category Code: 41 

Qual Category: Higher Certificate 
Univer Tech Qual: HEQF 
Qual College Code: 9 

Qual College: COLLEGE OF EDUCATION 
Qual College Abbreviation: CEDU 
Qual CESM1 Code: 7 
Qual CESM2 Code: 10 
Qual CESM3 Code: 1 

Qual CESM1: Education 

Qual CESM2: Teacher Education And Professional Development, Specific Levels And Methods 
Qual CESM3: Adult Education And Training 
Major Field Of Study: EDUCATION 
Set Group: EDUCATION 
Nat Group: HUM 
CESM Handling Ind: 1 
CESM Category: 7 
CESM Sub Category: 99 
Degree Level Code: U 

Degree Level: UnderGraduate 
Classification: SF 
Minimum Time: 1 


Curriculum 2015 

ODS 

^ PRE SME QAD RSK BIO GEO 
Modules : 10 



Figure 15. Example report with various navigation bars. 


In the the screen shown in Figure 15 not only the navigation bar is click-able, but any user can click through 
from this qualification level to any of the modules on the right. This would allow for exploration of the 
various modules contributing to the overall qualification. 


Conclusion 

The Digital DNA approach to data exploration presented in this article represents several shifts in the way 
that dashboard design and visualisation is usually approached. From a design perspective, a distinction is 
drawn between visualisations for learning analytics and for broader applications as decision support 
systems (DSS). While the DigitalDNA development has the student as an identified agent, the construct is 
more comparable with DSS development than with learner analytics visualisations. The DigitalDNA 
development can be classified as part of the Data-driven DSS toolset as described by Kacprzyk and Zadrozny 
(2007). The primary design concern is data integrity and linkage with a number of centrally enriched data 
sets. This means that exploration logic now becomes the focus as opposed to the needs of a particular user 
or the capabilities and interface of any particular dashboard development software. The focus is also not on 
data visualisation and memorability, but rather on having real-time navigable data with various displays 
for users to choose from. In addition, effort is given to link multiple data points in a way that is logical to 
the user. The user may thus identify and explore areas of concern of data surrounding key nodes, with the 
ability to extensively drill through, drill down, and aggregate upwards. The shift is thus from canned (pre- 
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packaged) reporting in the form of dashboards to the flexibility of exploration (on demand) of data at will 
with an export functionality to capture the real-time exploration. 

DigitalDNA approach caters for the highest level of user with the highest level of data literacy. It becomes a 
bank of all the possible data available. Any subset of this data can now be easily drawn into dashboards and 
score cards for users with lower levels of data literacy and more basic data needs. As all data is located in 
one warehouse with enrichment taking place prior to extraction, it becomes easier to apply business rules 
consistently and ensure data quality. This approach to data exploration has the added benefit of shifting 
from a situation where data is pushed onto the users to a pull approach where users can identify and explore 
only the data which is of real concern to them. Consult Table 5 for a summary of the shifts in approach. 


Table 5 

Shifts from Dashboards to DigitalDNA 


Dashboard 

DigitalDJVA 

Memorability & visualisation 

Navigability & flexibility 

Needs and requirements of user 

Exploration logic 

Capabilities of particular tool 

Any software can overlay 

Sub-sets of purposefully enriched data 

Integrity and linkages of enriched database 

Pre-packaged reporting (Push) 

Reporting on demand (Pull) 


The next step in the research and development will focus on visualisation in order to make navigation and 
exploration more intuitive. There will also be a focus on how to help users to identify possible problem areas 
for further exploration more easily out of the node dashboards. 


The aim of this article is not to market a particular application, but to illustrate a new approach to present 
vast quantities of data to users for sense-making. It attempts to show the shifts in design logic required to 
make data exploration and navigation more accessible and dynamic as the data available in Higher 
Education mushrooms. It emphasises a shift from a fragmented user focused dashboard development 
approach to networked data exploration which employs dynamic dashboards to facilitate data exploration 
and navigation. This represents a paradigm shift from the traditional approach which emphasizes 
memorability and visualisation. 
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